Web Page Classification Based on Surrounding Page Model Representing Connection Type and Directory Hierarchy
نویسندگان
چکیده
منابع مشابه
Automatic Web Page Classification
Aim of this paper is to describe a method of automatic web page classification to semantic domains and its evaluation. The classification method exploits machine learning algorithms and several morphological as well as semantical text processing tools. In contrast to general text document classification, in the web document classification there are often problems with short web pages. In this p...
متن کاملTowards Semantic Web-Based Yellow Page Directory Services
This paper describes the ongoing work of IWebS (Intelligent Web Services) project, which studies the possibilities of the Semantic Web technology in creating an yellow page directory service for end-users. We propose an ontology-based mechanism for both advertising and finding the services. The essential parts of the system are ontologies for describing and storing service advertisements, a sem...
متن کاملAutomatic Web Page Classification
To facilitate user browsing of Web, some websites such as Yahoo! (http://dir.yahoo.com) and Open Directory Project (http://dmoz.org) manually maintain a hierarchical structure. While manual classification of web pages provides high accuracy, it is very expensive. To automatically include new emerging pages into these hierarchies, web page classification becomes a hot research topic in web infor...
متن کاملWeb Page Downloading and Classification
This paper describes the processes of downloading and classifying Web-based articles in online medical journals as a preliminary step to extracting bibliographic data to populate MEDLINE , the widely used database of the National Library of Medicine (NLM). The processes are combined to develop an automated system named “Web Page Downloading and Classification”. The system downloads the Web page...
متن کاملHierarchy in Web Page Similarity Link Analysis
Rather than using traditional text analysis to discover Web pages similar to a given page, we investigate applying link analysis. Since web pages exist in a link-rich environment, that has the potential to relate pages by any property imaginable — since links are not restricted to intrinsic properties of the page text or metadata. In particular, while Web page similarity link analysis has been ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IPSJ Online Transactions
سال: 2009
ISSN: 1882-6660
DOI: 10.2197/ipsjtrans.2.107